CS 730 R : Topics in Data and Information Management – Big Data Analytics
نویسندگان
چکیده
The paper presents two concepts: entity resolution (ER, record linkage) and data privacy (DP). Authors presented a sketch of a framework for managing information leakage, and studied how the framework can be used to answer a variety of questions related to ER and DP. In the paper they studied the problems of measuring the incremental leakage of critical information. The framework bases on definitions and usage of two functions – match and merge. The former function allows to detect attribute values, which describe the same entity, while the latter function merges such values into one record describing such entity. Calling these functions subsequently incrementally builds a set of data that are disclosed about described entity. Authors used disinformation as a mechanism to minimize information leakage. The paper presents a model of the problem, shows an idea of the framework, explains motivation of authors, and provides plenty of examples, but for any details refers to the technical report.
منابع مشابه
Big Data Analytics and Now-casting: A Comprehensive Model for Eventuality of Forecasting and Predictive Policies of Policy-making Institutions
The ability of now-casting and eventuality is the most crucial and vital achievement of big data analytics in the area of policy-making. To recognize the trends and to render a real image of the current condition and alarming immediate indicators, the significance and the specific positions of big data in policy-making are undeniable. Moreover, the requirement for policy-making institutions to ...
متن کاملApplication of Big Data Analytics in Power Distribution Network
Smart grid enhances optimization in generation, distribution and consumption of the electricity by integrating information and communication technologies into the grid. Today, utilities are moving towards smart grid applications, most common one being deployment of smart meters in advanced metering infrastructure, and the first technical challenge they face is the huge volume of data generated ...
متن کاملCS 730 R : Topics in Data and Information Management – Big Data Analytics
In this paper authors introduced differential computation, which is a generalization of current techniques of incremental computations. They also introduced a definition of differential data, which shows practical application of differential computation in parallel settings. The motivation presented by authors emphasize performance of the new approach, which is very high especially for dynamica...
متن کاملP-V-L Deep: A Big Data Analytics Solution for Now-casting in Monetary Policy
The development of new technologies has confronted the entire domain of science and industry with issues of big data's scalability as well as its integration with the purpose of forecasting analytics in its life cycle. In predictive analytics, the forecast of near-future and recent past - or in other words, the now-casting - is the continuous study of real-time events and constantly updated whe...
متن کاملBig Data Quality: From Content to Context
Over the last 20 years, and particularly with the advent of Big Data and analytics, the research area around Data and Information Quality (DIQ) is still a fast growing research area. There are many views and streams in DIQ research, generally aiming at improving the effectiveness of decision making in organizations. Although there are a lot of researches aimed at clarifying the role of BIG data...
متن کامل